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1.  INTRODUCTION 


i u i. 


Let  x, , x-,...,  x be  independent,  normally  distributed  observations  with 
I Z n 

2 

common  variance  o and  (under  Hq)  with  common  mean  y.  Many  authors,  [1,3,4,5,6,9,10,11, 

12,13]  have  been  interested  in  the  problem  of  testing  for  outlying  observations,  (i.e., 

discovering  which  of  the  observations,  if  any,  come  from  a population  with  a mean 

different  than  p).  Grubs  [ 6 ] gives  an  excellent  review  of  the  literature  up  to 

1967.  He  discusses  procedures  which  are  recommended  for  the  following  situations: 

2 

1)  0 is  known 

2 2 

2)  a is  not  known  but  an  estimate  of  a is  known  which  is  independent  of 
the  sample  observations. 

3)  The  only  available  information  is  the  sample  observations. 

For  (3)  above,  Grubbs  discussed  procedures  for  the  cases  when  there  are: 

(i)  One  spurious  observation  on  the  high  (low)  side. 

(ii)  Two  spurious  observations  on  the  high  (low)  side. 

(ill)  One  spurious  observation  on  the  high  side  and  one  on  the  low  side. 

Grubbs  remarks  that  "these  techniques  are  not  generally  recommended  for  repeated 
rejection,  since  If  several  outliers  are  present  in  the  sample  the  detection  of  one 
or  two  spurious  values  may  be  'masked'  by  the  presence  of  other  anomalous  observations". 
In  other  words,  if  there  are  more  than  two  spurious  observations,  the  procedures 
given  will  not  be  apt  to  detect  the  one  or  two  "most"  spurious  observations. 

David  and  Paulson  [ 2 ] discuss  the  performance  of  several  of  the  tests  for 
outliers  under  the  alternative  that  there  is  one  spurious  observation.  McMillan  and 


David  [10  ] discuss  the  performance  of  the  two  procedures  when  o is  known  and  under 
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the  alternative  that  there  are  two  spurious  observations.  McMillan  [ 9 ] compares 

2 

the  performance  of  three  procedures  when  a is  not  known  and  under  the  alternative 

that  there  are  two  spurious  observations. 

Tietjin  and  Moore  [ 12  ] discuss  procedures  for  detecting  outliers  when  k of  the 

n observations  may  be  considered  suspect  with  k<n/2.  They  state  that  "suspected 

observations  sometimes  form  subgroups;  i.e.,  several  values  are  closer  to  each  other 

than  they  are  close  to  the  bulk  of  the  observations.  This  phenomenon  makes  sequential 

procedures  Insensitive.  It  has  been  called  the  masking  effect."  They  propose  two 

statistics  designed  to  detect  multiple  outliers  for  the  case  when  an  independent 
2 

estimate  of  o does  not  exist. 

Guttman  and  Smith  [7]  have  been  the  only  authors  who  have  considered  the  problem 
2 

of  estimating  0 when  spurious  observations  are  present.  They  considered  the  problem 
when  there  is  at  most  one  spurious  observation  and  when  p is  unknown,  only  for  a 
sample  of  size  n~3. 

2 

The  estimation  of  0 is  a very  inportant  problem.  If  the  data  analyst  has  a 
2 

"good"  estimate  of  o , he  could  use  it  to  help  judge  which,  if  any,  of  the  observations 

are  outliers  and  how  many  outliers  there  are.  Almost  all  of  the  tests  for  outliers 
2 2 

are  based  on  0 or  some  estimate  of  a , thus  it  is  clear  just  how  important  it  is 

2 2 

to  have  a "good"  estimate  of  o . New  estimators  of  o are  being  proposed  here 
and  some  of  their  properties  are  being  examined. 


2.  NOTATION  AND  MOTIVATION 


Let  x,,  x„,...,  x be  Independent  normally  distributed  observations  with  a 
1 4 n 

2 

common  variance  0 . Suppose  that  k^  of  the  observations  come  from  a population  with 

mean  p+A  and  k^  ■ n-k^  of  the  observations  come  from  a population  with  mean  p.  If 

2 2 

kj  - 0,  the  uniformly  minimum  variance  unbiased  estimator  of  a is  given  by  s - 

2 2 
ECXj-x)  /(n-1).  Note  that  one  can  also  calculate  s by  the  formula 

s2  - E”-1Ij_1(x1-Xj)2/2n(n-l).  This  is  an  interesting  way  to 

2 

look  at  s for  if  one  of  the  observations  were  an  outlier,  say  x^,  then  the  absolute 


t 
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* 


differences  |k^-x^|,  ij*q  would  be  large  in  comparison  to  the  absolute  differences 

not  involving  x^.  If  one  knew  that  x was  a spurious  observation  and  the  only  spurious 

2 

observation,  the  uniformly  minimum  variance  unbiased  estimator  of  0 would  be  given 
by 

^i-l^j-l  cij(VXj)2/2(n-1)<n-2) 
where  c^“0  if  i“q  or  if  j*»q  and  c^j*l  otherwise. 


More  generally,  suppose  x^  , x^ 


come  from  the  population  with  mean 


U+A  and  x 


M ^2  v*k1 

come  from  the  population  with  mean  p.  Let  A'fq^^,. . . ,q^  } 


and  B-{p^,p2, . . • ,Pk  }.  Then  the  uniformly  minimum  variance  unbiased  estimator  of 

2 2 
o is 


zi-izj-icij  (*rxj)"/2[ki(kr1)+k2(k2'1)1 


where  c^  - 1 if  i e A,  j e A or  if  i e B,  j e B and  c^  = 0 if  i e A,  j e B or  if 
i e B,  j e A. 

An  intuitive  feeling  for  what  could  be  done  is  to  exclude  from  the  sum, 
n n 2 

E^jE^tXj-x  ) , those  terms  for  which 


Xi-Xj 


is  quite  large,  Bay  larger  than 


some  prespecified  constant  c.  That  is,  it  seems  reasonable  that  a "good"  estimator 
2 

of  o can  be  based  on  the  statistic 

Qc  " Ei-i£J«icij<xi“xj>2  where  cij  " 1 if  lxi_xj I and  cij  " 0 lf  lxi“xjl>c- 
This  is  similar  to  a successful  procedure  introduced  by  Johnson  and  Graybill  [ 8 ] 

2 

for  estimating  0 and  locating  outliers  in  the  two-way  classification  model. 

Of  many  methods  of  choosing  c,  there  are  two  which  are  appealing.  First,  if 

2 

the  experimentor  has  some  prior  knowledge  about  the  relative  size  of  0 , he  could 

choose  c accordingly.  That  is,  c could  be  chosen  as  some  multiple  of  0 such  as 

c s 2 Jl  0.  This  particular  value  would  eliminate  those  differences  from  the  sum 

which  are  more  than  two  standard  deviations  apart. 

A second  method  of  choosing  c is  to  choose  c such  that  a given  percentage  of 

the  absolute  differences  are  excluded  from  the  sum.  To  do  this  let  » Ix^-x^l  for 

n and  let  u/..Au,,.2><>2  u be  the  ordered  absolute  differences. 

*2'  (n(n-l)/2) 


One  can  then  let  c ■ u.  , for  some  r,  lSr£n(n-l)/2.  If  there  are  k,  outliers  in  the 

in)  i 

data,  there  will  be  kj^  conPar^80ns  which  should  be  excluded  from  the  sum.  Thus, 

the  obvious  choices  for  r which  have  been  considered  are  l'(n-l),  2*(n-2),  3'(n-3),...  . 

2 

In  the  next  section  new  estimators  of  0 are  proposed  and  some  of  their  properties 
are  discussed.  In  some  cases  their  properties  are  examined  by  exact  methods  and  in 
other  cases  the  properties  are  examined  by  Monte-Carlo  methods. 


3.  ESTIMATORS  OF  0 

3.1  Method  1 Estimators 

Let  Qc  * iJ.ilJ.jCy  (x1-xj)2/2n(n-l)  where  c^  *=  1 if  Ixj-x^^c  and  c^  = 0 if 
|xj-Xj|>c  and  where  c is  some  prespecified  constant. 

First  the  expected  value  of  is  obtained.  The  expected  value  of  Qc  is  ob- 
tained by  making  repeated  applications  of  the  following  lemma. 

2 

Lemma  3 . 1 Suppose  V/VN(6,  0 ).  Let  G be  the  random  variable  defined  by 
G»1  if  )w|lc  and  G-0  if  lw|>  c.  Then 

E (GW2 ) - 02[(l+62/02)(l-z((c-6)/0)-z((c+6)/0)  - /=(^exp{-(c-6)2/202}  + 

,00 

exp{-(c+6)2/2G2}]  where  z(a)  = j (I//* 2ff)  exp{-x2/2}dx. 


Proof : 


UGW2) 


-c 


w f^ (w)  dw 


-c 


To  evaluate  the  integral  on  the  right  above,  let  v * (w-6)/0,  then 
r(c-6)/0 

/2? 


E(GW  )- 


2 2 

e v The  result  above  is  obtained  from 


-(c+6)/0 
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this  by  straight  forward  integration. 


The  expected  value  of  Qc  is  given  by  the  following  theorem. 

Theorem  3.1  Suppose  x^  are  independent  normally  distributed  random 

2 

variables  with  a common  variance  o . Suppose  that  k^  of  the  x^  come  from  a population 
with  mean  y + X and  the  remaining  ■ n-k^  of  the  x^  come  from  a population  with 
mean  y.  Then 


E(V  - SiS=I) 


2 . . 2 


{(kx  + k2  - n)  (l-2z (c/o/2)  - (c/o/ir)  e 


-c2/4o2. 


+ 2k1k2[(l+A2/2o2)(l-z((c+X)/a«6)  - z((c-X)/o/2)) 
- ^ ((c+X)  e'(c"A)2/4°2 -(c-X)  e"(c+A)2/4cf2)]> 


(3.1) 


Proof:  Without  any  loss  of  generality  it  can  be  assumed  that  x^»x2 x^  have 

mean  y + X and  x^+j,  » * " * » xn  have  mean  y.  Let  S^  = {1,2,...,  k^ } and 

s2  ■ {kj+1,  kj+2 . . . , kj  + k2>. 


E(2n(n-l)Qc)  - (x^)2] 


Ei-lEj-lElCij*Xi"xJ*  1 


L E[c  (x.-x  )2]  + £ E[c. , (x  -x. )2] 

i,jLS1  J iE8ltJCS2lJ  j 


+ £ E(c . . (x.-x. )2 J + l Efc  (x.-x  )2] 

ies2,jes1  J i,jes2  J 

- Hj  + H2  + H3  + H4  (say). 


Each  of  the  terms  in  and  where  i^j  have  x^-x^ ''44(0,20  ),  thus,  according  to 


Lemma  3.1, 
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ElCij  (Xi-Xj >2]  * 2o2[l-2z(c/a/2  )-  (c/o/ft)e  C (say) 


for  every  term  in  and  H^. 

2 2 
Each  term  in  H2  has  xi~Xj'''N(X,2o  ) and  each  term  in  H ^ has  x^-x^tK-A , 2o  ) 

hence,  according  to  Lemma  3.1, 


EIcij (xi”xj  > 1 


1 

/2v 


c+A 

o/2 


- 2a2[(l+A2/2a2)(l-z((c-A)/a/2)  - 

-(c-A)2/4o2  . c-A  -(c+A)2/4a2  ' 

e + — — e 

o/2 


z((c+A)/a/2) 
] = G2  (say) 


for  every  term  in  and  H^. 


Therefore 


E[2n(n-l)Qc]  - k1(k1-l)G1  + kjk^  + kjk^  + k2(k2-l)  Gy 
‘ (kx2  + k22-n)  Gx  + 2kLk2G2 

2 2 

= 2 o2  { (kL2  + k22-n)(l-2z(c/o/2)  - (c/a/it)  e~c  /4°  ) 

+ 2kxk2 [ (1+A2/2o2) (1-z ( (c-A) /o/2)-z ( (c+A )/a/2) ) 


1 

/2v 


— -(c-A)2/4«t2  — - 

0^2  e + o/2 


-(c+A)2/4a2 


]} 


The  deelred  result  follows  from  this. 

The  following  two  corollaries  are  useful  in  choosing  an  estimator 

2 

of  o based  on  Q . 

c 
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Corollary  3.1  If  A«0  or  equivalently  if  k^“0,  then 

2 2 

E[Q  ] - a2(l-2z(c/o/2)  - (c/a/ir)  e"c  /4a  ) 


Corollary  3.2 


Lim  E[Qc ] = 


2 2 

k,  + k,  -n  2..  2 . 

1 2 [ l-2z (c/o/2)  - (c/o/F)  e"C  /4°  ] o. 


2 ti  — 9 

Theorem  3.2  Let  - ^i-l^Xi_^  / (n— 1 ) . Under  the  assumption  given  in  Theorem  3.1, 

k.  k 

-,-2.  2 *1*2  ,2 
E(S  ) * a + —7 — r>  A . 

n(n-l) 


2 2 2 

Tables  1 and  2 give  values  of  E(Q ^/a  ) and  E(S  /o  ) for  some  different  values  of 
n,  k ^ , A,  and  c. 

1.  E(Qc/02)  and  E(S2/o2)  when  n = 10. 


m 

J 2a 

2/2 a 

3/2a 

k/l a 

E< 

ki 

1 

2 

3 

m 

2 

3 

m 

2 

3 

m 

2 

3 

H 

A=0 

.199 

.199 

.199 

.739 

.739 

.739 

.971 

.971 

.971 



.999 

.999 

A-3o 

, 170 

. 148 

.132 

.761 

.779 

.792 

1 . 390 

1.716 

1.949 

1.782 

2.392 

1.9 

A“6o 

.159 

.128 

.106 

.598 

.488 

.410 

.916 

.874 

.843 

1.696 

2.239 

4.6 

A-9o 

.159 

.128 

.106 

.591 

.476 

.394 

.777 

.627 

.519 

.823 

.687 

.590 

9.1 

A*  12a 

.159 

.128 

.106 

.591 

.476 

.394 

.777 

.626 

.518 

.799 

.644 

.533 

15.4 

Am  <*> 

.159 

.128 

.106 

.591 

.476 

.394 

.777 

.626 

.518 

.799 

.644 

.533 

OO 

.2  ,_2V 
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2.  E(Qc/o2)  and  E(S2/02)  when  n = 25. 


■ 

/la 

2 /2a 

3/2 o 

4/20 

2 2 
E(SVo  ) 

u 

■ 

2 

3 

m 

2 

3 

■1 

2 

3 

1 

2 

3 

1 

2 

3 

A*0 

.199 

.199 

.199 

.739 

.739 

u 

H 

.971 

.999 

.999 

.999 

1.0 

1.0 

l.C 

A*3o 

.187 

.177 

.167 

.748 

.764 

1.432 

1.312 

1.600 

1.861 

1.375 

1.690 

1.9 

A«6o 

.183 

.168 

.155 

.683 

.584 

.949 

.929 

.911 

1.278 

1.534 

1.766 

2.44 

3.76 

4.9 

■ A=9o 

.183 

.168 

.155 

.679 

.625 

.576 

.893 

.822 

.758 

.929 

.864 

.806 

4.24 

7.21 

9.9 

.183 

.168 

.155 

.679 

.625 

.576 

.893 

.822 

.757 

.919 

.846 

.779 

6.76 

12.04 

16. 8 

j A=  » 

.183 

.168 

.155 

.679 

.625 

.576 

.893 
. 

.822 

.757 

.919 

.846 

.779 

oo 

OO 

OO 

Upon  examination  of  Tables  1 and  2 one  can  see  that  Qc  is  not  affected  as  much  by 

2 2 

outliers  as  is  the  usual  estimator  of  O , S . Unfortunately,  Qc  usually  under 
2 

estimates  O . Examination  of  these  two  tables  shows  that  one  must  be  careful  not  to 
pick  c too  small.  The  most  optimal  choice  appears  to  be  somewhere  around  3/2o  and 
4 /2a.  Although  it  would  be  possible  to  divide  Qc  by  a constant  to  reduce  the  bias, 
this  will  not  be  done  here  because  other  estimators  are  being  proposed  which  have 
more  desirable  properties. 


3.2  Method  2 Estimators 


Let  u 


lj 


for  i<j  - 1,  2,...,  n and  let  U(1)  > u (2)~  u(n(n-l)/2) 


be  the  ordered  values  of  the  . If  there  is  one  outlier  in  the  data  (i.e.  k^“l) , 
it  is  reasonable  to  hope  that  the  n-1  differences  between  the  outlier  and  the  re- 
maining observations  will  be  larger  than  the  (n-l)(n-2)/2  differences  between  the 
remaining  n-1  observations.  Hence,  if  these  n-1  differences  are  removed  from  the 


sum  of  all  squared  differences,  a reasonable  estimator  of  a would  be  given  by 
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vi  ■ Cl  »u,)«<»-i)o-2) 

or  equivalently  by 

V1  = ^i<jUij  “ ^i*l  U(i)^n“1^n”2^’ 

2 

More  generally,  for  k^  outliers  an  intuitive  estimator  of  a is  given  by 
Vkj  “ (^i<jUij  " ^i=l  U(i)  + k2(k2-1))‘ 

In  order  to  examine  the  properties  of  these  estimators  and  to  evaluate  how 
"good"  these  estimators  are,  1000  random  samples  of  size  10  were  generated  for  each 
of  several  cases.  This  was  done  for  k^  *=  1,  2,  3,  5 and  X/a  = 0,  3,  6,  9,  12.  The 
results  are  presented  in  Tables  3-6. 


r 


I 


I 
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3.  The  mean,  variance,  and  mean  square  error  of  V^. 


a 

A/o 

E (V^o2) 

Var (Vx/o2) 

MSE^/o2) 

0 

.5057 

.0594 

.304 

ii 

3 

.7948 

.1209 

.163 

■ 

6 

.9897 

.2091 

.209 

m 

9 

1.0005 

.2549 

.255 

IS 

12 

1.0116 

.2551 

.255 

2 

3 

1.2081 

.2060 

.249 

2 

6 

3.2801 

.6686 

5.867 

2 

9 

7.0790 

1.5713 

38.526 

2 

12 

12.6224 

3.1477 

138.228 

3 

3 

1.6060 

.3687 

.736 

3 

6 

5.2718 

1.4831 

19.732 

3 

9 

11.936 

3.5253 

123.138 

3 

12 

21.5141 

6.7534 

427.580 

5 

3 

1.9605 

.4909 

1.413 

5 

6 

7.0175 

2.2324 

38.442 

5 

9 

16.0074 

6.0760 

231.299 

5 

12 

29.0490 

10.4993 

797.248 

E(V2/o2) 

.. 

Var(V2/o2) 

mse(v2/o' 

| 

----- 

.3058 

.0233 

.505 

.4595 

.0475 

.340 

.5244 

.0683 

.294 

.5208 

.0761 

.306 

.5270 

.0756 

.299 

.6618 

.0765 

.191 

.9505 

.1996 

.202 

1.0159 

.2685 

.269 

.9914 

.2685 

.269 

.9264 

.1225 

.128 

2.5836 

.3504 

2.858 

5.6753 

.9042 

22.763 

10.2552 

1.7572 

87.415 

1.1991 

.1812 

.221 

4.2833 

.8790 

11.659 

10.0579 

2.7473 

84.793 

18.6274 

4.8748 

315.601 

i 


5.  The  mean,  variance,  and  mean  square  error  of 


A/o 

e(v3/o2) 

Var(V3/o2) 

2 

MSE(V3/o  ) 

0 

,2039 

.0110 

.6448 

3 

.3022 

.0226 

.0301 

.5095 

6 

. 3374 

.4691 

9 

.3370 

.0347 

.4743 

12 

.3401 

.0335 

.4690 

3 

.4276 

.0375 

.3651 

6 

.5492 

.0826 

.2858 

9 

.5711 

.0913 

.2753 

12 

.5587 

.0934 

.2881 

3 

.5794 

.0574 

.2343 

6 

.9528 

.1761 

.1783 

9 

.9805 

.2710 

.2714 

12 

.9905 

.2793 

.2794 

3 

.7780 

.0727 

.1220 

6 

2.3389 

.2317 

2.0243 

9 

5.1775 

.6986 

18.1497 

12 

9.5095 

1.3469 

73.7588 
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Examlnation  of  Tables  3,  4,  5,  and  6 reveals  that: 

2 

(i)  under  estimates  o whenever  k ’k  ^ and  A Is  small. 

2 

(11)  is  approximately  unbiased  for  a when  A Is  larger  and  k“k^. 

2 

(111)  over  estimates  o whenever  k<  k^  and  A Is  not  small. 

2 

With  regard  to  (11)  above.  It  Is  easily  shown  that  E(V^)  o as 

A -*•  « when  k»k^.  If  A Is  large  and  the  number  of  outliers  Is  known  then 

2 

V.  is  the  "best"  estimator  of  o that  is  available. 

K1 

2 

Let  V*  ■ V /v. when  (V^/g  ).  Thus  V*  will  be  an  unbiased  estimator 

2 A«0 

of  o where  A*0;  i.e.  when  there  are  no  outliers  in  the  data.  For  those  cases 

« 2 * 
when  A?0  and  k^>  0,  V*  will  be  a conservative  estimate  of  o . That  is, 

2 2 

over  estimates  o . While  it  Is  true  that  S , the  usual  sample  variance, 

2 

Is  also  a conservative  estimator  of  o It  Is  generally  much  more  conserva- 
tive than  V^.  For  comparison  purposes  Table  7 gives  the  expected  value  of 

2 2 
V*  and  S for  several  alternative  models.  The  expected  values  of  S are 

exact  while  those  for  VJ  come  from  the  Monte-Carlo  Study. 

2 2 

Examination  of  Table  7 reveals  that  V*  is  less  biased  for  a than  S 

2 2 

when  and  Vg  is  less  biased  for  o than  both  S and  V*  when  k-^- 

Similarly  V*  is  less  biased  than  S^,  V*,  and  V*  for  k^i.  3 and  is  less 
2 

biased  than  S , V*,  and  V£,  and  V*  for  5.  Thus  from  the  above  it 

2 

would  seem  that  V£  should  be  the  preferred  estimator  of  a for  all  of  the 
alternative  cases;  however,  one  should  first  examine  the  variance  and/or 
the  mean  square  error  of  these  estimators  before  making  a final  decision. 

Since  all  of  these  estimators  sre  biased  except  when  A-0,  only  the  mean 
square  error  of  the  estimators  will  be  given.  Table  8 gives  the  mean  square 


I 
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7.  The  mean  of  S2,  V^,  V2*,  V^*,  and  V5*. 


K1 

\/a 

E(S2/o2) 

E(V  */ 02) 
1 

E(V  */o2  ) 
2 

E(V  Vo2) 
3 

E(V  */02  ) 
5 

- 

0 

1 

1 

l 

l 

l 

1 

3 

1.9 

1.572 

1.503 

1.A82 

1.A72 

1 

6 

A. 6 

1.957 

1.715 

1.655 

1.625 

1 

9 

9.1 

1.978 

1.703 

1.653 

1.627 

1 

12 

15. A 

2.000 

1.723 

1.668 

1.6A3 

2 

3 

2.6 

2.389 

2. 165 

2.098 

2.075 

2 

6 

7. A 

6.A86 

3. 109 

2.69A 

2.559 

2 

9 

15. A 

13.998 

3.323 

2.802 

2.638 

2 

12 

26.6 

2A.959 

3.2A2 

2.  7A1 

2.589 

3 

3 

3. 1 

3.176 

3.030 

2.8A2 

2.792 

3 

6 

9. A 

10. A2A 

8.A50 

A.67A 

A.  032 

3 

9 

19.9 

23.603 

18.561 

A. 810 

3.998 

3 

12 

3A.6 

A2.5A1 

33.5A0 

A. 859 

A.  092 

5 

3 

3.5 

3.877 

3.922 

3.816 

3.662 

5 

6 

11.0 

13.876 

1A. 009 

11.A73 

6.700 

5 

9 

23.5 

31.652 

32.895 

25. 397 

6.973 

5 

12 

A1.0 

57.AAO 

60.922 

A6.6A7 

6.915 
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8. 

The  mean  square 

2 * 

error  of  S , 

* * 

, v2  . v3 

and  V5*. 

K1 

A/o 

MSE(S2/a2) 

MSE(V*/a2)  MSE(V*/a2) 

1 2 

MSE(V*/o2) 

3 

MSE(V*/o2) 

5 

- 

0 

.22 

.23 

.26 

.26 

.29 

1 

3 

1.43 

.80 

.77 

.78 

.81 

1 

6 

14.78 

1.73 

1.26 

1.  15 

1. 14 

1 

9 

69.43 

1.95 

1.33 

1.26 

1.26 

1 

12 

213.98 

2.00 

1.36 

1.25 

1.24 

2 

3 

3.49 

2.73 

2.20 

2.  11 

2.  14 

2 

6 

44.03 

32.71 

6.64 

4.86 

4.44 

2 

9 

213.98 

175.09 

8.35 

5.44 

4.79 

2 

12 

666.96 

586.34 

7.98 

5.28 

4.68 

3 

3 

5.57 

6.17 

5.47 

4.77 

4.78 

3 

6 

74.52 

94.62 

59.35 

17.73 

13.22 

3 

9 

365.83 

524.69 

318.34 

21.04 

13.78 

3 

12 

1144.12 

1752.06 

1078. 17 

21.61 

14.76 

5 

3 

7.58 

10. 19 

10.53 

9.68 

8.96 

5 

6 

104.67 

174.52 

178.90 

115.26 

40.  14 

5 

9 

516.47 

963.33 

1047.49 

612.03 

46.80 

5 

12 

1618.00 

3226.57 

3644.23 

2116.09 

45.38 
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error  of  these  estimators.  Note  that  when  A-0,  MSE(V*)=»VAR(V*) . 

From  Table  8 it  is  interesting  to  note  that  when  A=0,  the  variances  of 

2 

the  estimators  V*,  V*,  V*,  and  V*  are  not  much  larger  than  the  variance  of  S . 

That  is,  when  there  are  no  outliers  in  the  data  one  does  not  lose  much  efficiency 

2 

by  choosing  one  of  the  estimators  V*,  V*,  V*,  or  V*  rather  than  S . This  seems 
to  be  a small  price  to  pay  for  the  reduction  in  bias  that  is  made  when  outliers 
are  in  the  data. 

To  summarize  the  results  in  this  subsection  a procedure  which  might  be 

used  when  one  desires  to  estimate  the  variance  of  a sample  of  size  10  which 

may  or  may  not  have  outliers  in  it  can  be  described  as  the  following: 

(i)  Decide  on  an  upper  bound,  k,  for  the  number  of  outliers  in  the 

sample.  If  the  experimentor  is  reluctant  to  do  this,  he  may  choose 

k“5;  i.e.  half  of  the  data  are  outliers  or  the  data  is  a mixture 

of  two  populations. 

(ii)  Calculate  both  and  V*. 

2 

(iii)  Conclude  that  o is  quite  likely  to  fall  somewhere  between  and 

vs- 

2 

With  this  new  information  about  a it  may  now  be  possible  to  decide 
whether  some  of  the  data  are  spurious  observations  or  not. 

4.  ESTIMATORS  OF  o 2 AND  A 

2 

In  this  section  the  problem  of  estimating  both  o and  A is  considered. 

2 

Estimators  of  o and  A are  defined  and  their  properties  discussed.  Through- 
out this  section  it  is  assumed  that  k^,  the  number  of  outliers,  in  the  data, 


is  known. 
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4.1  Two  equation  estimators 


Let  be  the  sample  variance  and  Qc  be  as  defined  In  section  3.1. 


How  then  E(S2)  - o2  + kjk^  X2/n(n-l)  and  E(Qc>  Is  given  by  ( ).  A 


pseudo-method  of  moments  estimator  of  0 and  A can  be  obtained  in  the  following 


2 '2  a 

Find  the  values  of  o and  A,  say  o and  A , such  that 

c c 


o2  + kjk^  X2/n(n-l) 


^ A A 

/-  * 1 


g^°  c*  * c*  wher*  8^  O , A)  - E(Qc). 


Unfortunately,  these  two  equations  do  not  always  have  a solution. 


In  this  case  a and  X are  found  so  that  the  sum  of  the  squared  residuals 
c c 


Is  minimized.  That  is,  O and  X obtained  so  that 

c c 


R . {s2-o2  - k^A2/"^1)!  + fQc  ~ \.)J  ls  “inimized. 


The  procedure  that  was  used  to  find  the  values  of  Oc  and  Ac  that 
minimize  R ls  a combination  of  the  Gauss-Newton  method  and  the  method  of 
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Steepest  Descent  for  estimating  the  parameters  of  non-linear  models. 

-2 

Tables  9-12  give  the  mean,  variance,  and  mean  square  error  of  o^  and  X^ 
for  various  values  of  c,  X,  and  from  the  Monte  Carlo  study. 

Examination  of  these  tables  show  that  X does  a "surprisingly  good" 

c 

job  of  estimating  X for  all  choices  of  c and  for  each  value  of  . In  terms 

of  estimating  o^,  however,  a value  of  c somewhere  around  2/2o  or  3/2 a seems  to 

2 

give  approximately  unbiased  estimates  of  a for  most  values  of  X and  k^.  In 

-2  2 

other  words,  in  order  to  use  a to  estimate  a , one  should  be  careful  in 

c 

selecting  the  value  of  c so  that  It  is  neither  too  small  or  too  large. 
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9.  The 

mean. 

variance , 

and  mean  square  error 

of  0 and  A 
c c 

when  = 

1. 

c/o 

X/o 

-2  2 

E(o  /o  ) 
c 

'2  2 

Var(o  /o  ) 
c 

-2 

0 

MSE(-~ ) 
0 

E(X  /o2) 
c 

Var(^) 

X 

MSE(— ) 

0 

JT 

3 

.411 

.058 

.405 

3.803 

1.402 

2.047 

JT 

6 

.345 

.025 

.454 

6.439 

1.122 

1.315 

ST 

9 

.342 

.025 

.457 

9.305 

1.105 

1.198 

JT 

12 

.348 

.025 

.449 

12.227 

1.197 

1.249 

1ST 

3 

1.045 

.281 

.283 

2.405 

2.446 

2.800 

2 JT 

6 

1.252 

.826 

.890 

5.624 

1.532 

1.674 

iJT 

9 

1.233 

.535 

.589 

8.785 

1.317 

1.363 

2/r 

12 

1.220 

.448 

.497 

11.845 

1.221 

1.245 

3/2~ 

3 

1.536 

.620 

.907 

2.425 

2.562 

2.893 

3/1' 

6 

1.043 

.449 

.450 

5.833 

1.573 

1.601 

3 JT 

9 

1.108 

.646 

.657 

8.880 

1.156 

1.170 

iJT 

12 

1.164 

.777 

.803 

11.879 

1.235 

1.250 

ujt 

3 

1.872 

.854 

1.614 

3.235 

2.321 

2.376 

ujt 

6 

1.213 

.603 

.648 

5.702 

2.541 

2.630 

uJT 

9 

.967 

.285 

.286 

8.971 

1.064 

1.065 

ujt 

12 

1.033 

.328 

.329 

11.938 

1.178 

1.182 
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10.  The  mean,  variance,  and  mean  square  error  of  0c  and  Ac  when  = 2. 


c/o 

A/o 

a?  2 

E(0  /O  ) 
c 

Var(o^/o2) 

MSE (oJ/02) 

E(Ac/0) 

Var(Ac/a) 

MSE(Ac/a) 

JT 

3 

.536 

.204 

.390 

3.239 

1.232 

1.289 

JT 

6 

.569 

.028 

.459 

6.240 

.631 

.687 

JT 

9 

.343 

.029 

.473 

9.  184 

.598 

.632 

JT 

12 

.333 

.223 

.627 

12. 123 

.634 

.649 

2J2 ~ 

3 

1.243 

.866 

.925 

2.509 

1.501 

1.742 

2/r 

6 

1.328 

1.636 

1.744 

5.730 

1.028 

1.101 

2 JT 

9 

1.254 

.584 

.649 

8.879 

.701 

.716 

2 IT 

12 

1.209 

.473 

.517 

11.927 

.655 

.660 

3^2~ 

3 

1.516 

.899 

1.165 

2.599 

1.286 

1.447 

3^2~ 

6 

1.055 

.861 

.864 

5.882 

1.010 

1.024 

9 

1.146 

.773 

.794 

8.921 

.621 

.627 

3^“ 

12 

1.171 

1.043 

1.072 

11.934 

.677 

.681 

3 

2.291 

1.490 

3. 158 

2.469 

1.605 

1.887 

k?r 

6 

1.398 

1.635 

1.793 

5.688 

1.903 

2.000 

uJT 

9 

.965 

.257 

.259 

8.983 

.590 

.590 

uJT 

12 

1.010 

.325 

. 326 

11.974 

.636 

.637 
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11. 


The  mean,  variance. 


~ 2 

and  mean  square  error  of  Oc  and 


when  k, 
c 1 


3. 


c/a  A/a  E(o2/a2)  Var(o2/o2)  MSE(o2/o2)  E(A „/o)  Var(A „/o)  MSE(A  /a) 


/2~ 

2 

.721 

.684 

/r 

6 

.389 

.271 

/r 

9 

.355 

. 197 

/T 

12 

.476 

2.575 

2/r 

3 

1.144 

1.067 

2/T 

6 

1.277 

1.857 

2/r 

9 

1.254 

1.286 

2/r 

12 

1.333 

3.513 

2/r 

3 

.862 

.737 

2/r 

6 

1.067 

. 746 

2/r 

9 

1.075 

.614 

2/r 

12 

1.199 

2.098 

u/T 

3 

1.250 

.758 

4 ZT 

6 

1.327 

1.651 

4 JT 

9 

.923 

.286 

k/T 

12 

1.008 

.325 

.761 

2.963 

1.556 

1.557 

.643 

6.  167 

.572 

.600 

.612 

9.110 

.486 

.498 

2.850 

12.042 

.621 

.623 

1.088 

2.771 

.797 

.849 

1.934 

5.794 

.822 

.864 

1.350 

8.901 

.581 

.591 

3.624 

11.884 

.566 

.579 

.756 

3.  192 

.733 

.770 

.750 

5.924 

.612 

.618 

.619 

8.933 

.463 

.467 

2. 138 

11.906 

.576 

.585 

.821 

3.374 

1.363 

1.503 

1.758 

5.853 

.862 

.878 

.292 

8.973 

.459 

.460 

.325 

11.949 

.470 

.473 

w 
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12.  The 

mean. 

variance. 

and  mean  square  error  of 

O and  A 

c c 

when  = 

5. 

c/a 

X/o 

*2  . 2 
E(ac/o  ) 

2 

Var(oc/o  ) 

MSE(o^,/o2) 

E(Ac/o) 

\ 

Var (Ac/o) 

MSE( A /a) 
c 

JT 

3 

1.089 

1.400 

1.408 

2.  718 

1.513 

1.593 

JT 

6 

.448 

.908 

1.213 

6.  152 

.578 

.601 

JT 

9 

.419 

.962 

1. 300 

9.092 

.482 

.490 

JT 

12 

.521 

5.406 

5.635 

12.043 

. 596 

.598 

2/T 

3 

1.195 

1.632 

1.670 

2.  774 

.888 

.939 

iJT 

6 

1.343 

2.353 

2.471 

5.836 

.776 

.803 

l/T 

9 

1.375 

2.768 

2.909 

8.903 

.746 

.755 

iJT 

12 

1.459 

7.303 

7.513 

11.902 

.645 

.655 

3/2~ 

3 

.865 

.620 

.788 

3.  152 

.614 

.637 

3 JT 

6 

1.107 

1.247 

1.  117 

5.939 

.636 

.640 

3 JT 

9 

1.051 

.414 

.644 

8.960 

.448 

.450 

3 JT 

12 

1.142 

1.834 

1.354 

11.952 

.441 

.443 

uJT 

3 

1.339 

1.104 

1.219 

3.286 

.854 

.936 

u/T 

6 

1.299 

2.560 

2.650 

5.915 

.753 

.760 

u/r 

9 

.890 

.202 

.212 

8.991 

.441 

.441 

4 JT 

12 

.979 

.224 

.224 

11.978 

.412 

.412 

4.2  Five  equation  estimators 

2 

The  estimators  of  o and  A that  are  proposed  in  this  section  are  similar 
to  those  in  the  last  section  except  that:  all  four  equations  for  are  used. 
That  is  o2  and  A are  found  so  that  the  sum  of  the  squared  residuals. 


Rx  = [S 


- o2-k1k2X2/n(n-l)]2  + HQj/2  " Sj/J  <o2,A)]2, 


is  minimized.  Once  again  the  procedure  that  was  used  to  find 
is  a combination  of  the  Gauss-Newton  method  and  the  method  of 
for  estimating  the  parameters  of  non-linear  models. 

Table  13  gives  the  mean,  variance,  and  mean  square  error 


the  values  of 
Steepest  Descent 

A2 

of  a and  X for 


various  values  of  X and  k^  from  the  Monte  Carlo  study. 
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13. 

The  mean. 

variance 

, and  mean 

square  error 

of  o and 

X. 

-2  2 

-2  2 

*2  2 

K1 

A/o 

E(o  /o  ) 

Var(o  /a  ) 

1 MSE(o  /o  ) 

1 E(A/o) 

Var (X/o) 

MSE(A/o) 

1 

3 

1.419 

.627 

.802 

1.589 

2.731 

4.722 

1 

6 

1.079 

.485 

.491 

5.781 

1.942 

1.990 

1 

9 

1.008 

.305 

.305 

8.950 

1.C59 

1.062 

1 

12 

1.069 

.352 

.357 

11.924 

1.165 

1.170 

2 

3 

1.201 

.906 

.947 

2.642 

.961 

1.089 

2 

6 

1.100 

.937 

.947 

5.855 

1.025 

1.046 

2 

9 

1.007 

.277 

.277 

8.972 

.578 

.579 

2 

12 

1.046 

.358 

.360 

11.967 

.623 

.624 

3 

3 

.761 

.490 

.547 

3.071 

.645 

.650 

3 

6 

1.084 

.704 

.711 

5.913 

.592 

.600 

3 

9 

.962 

.297 

.298 

8.965 

. 446 

.447 

3 

12 

1.045 

.365 

.367 

11.943 

.458 

.461 

5 

3 

.818 

.431 

.464 

3.058 

.509 

.512 

5 

6 

1.044 

.552 

.554 

5.966 

.441 

.442 

5 

9 

.938 

.212 

.216 

8.984 

.428 

.428 

5 

12 

1.009 

.245 

.245 

11.974 

.399 

.400 

-26- 


Examination  of  Table  13  shows  that  o and  X are  approximately  unbiased 

2 

estimates  of  o and  X for  all  values  of  k^  except  when  X=3.  I think  that  one 

reason  that  the  estimators  are  not  unbiased  when  X=3  is  that  the  values  of  C 

used  were  quite  large  with  respect  to  this  value  of  X and  in  this  case  the 

values  of  Q were  approximately  the  same  for  all  c greater  than  /2.  I believe 
c 

that  if  the  values  of  c were  chosen  smaller,  that  the  procedure  suggested  here 

2 

will  do  a good  job  of  estimating  a and  X for  small  values  of  X as  well. 

5.  SUMMARY  AND  CONCLUSIONS 

The  estimators  and  V*  are  quite  simple  to  calculate  and  give  a good 

2 

range  in  which  o will  probably  fall.  However,  tables  for  the  divisors  have 
not  yet  been  generated  for  any  cases  other  than  when  the  sample  size  is  10. 

If  a high  speed  computer  is  available,  I think  estimators  similar  to  those 
given  in  Section  4.2  hold  the  most  promise.  Different  methods  for  choosing  the 
cs  should  be  evaluated.  Perhaps  the  most  useful  would  be  choosing  a set  of  c's 
which  are  functions  of  the  sample  values  such  as  functions  of  the  range  of  the 
sample . 
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